Classifying Salient Textual Entities in the Headlines and Captions of Grouped Bar Charts

نویسندگان

  • Richard Burns
  • Sandra Carberry
  • Stephanie Elzer Schwartz
چکیده

Information graphics, such as grouped bar charts, generally have a communicative message that they are intended to convey when they appear in popular media. Communicative signals are typically designed into the graphic to help convey to the graph viewer these intended messages. We have designed and implemented a system that automatically hypothesizes the intended message of a grouped bar chart from communicative signals that are automatically extracted from the graph. Analysis of our system revealed that textual evidence, such as graph entities mentioned in the headline or caption of the graphic, was the most important piece of evidence in our system. This paper describes a support vector machine classifier that takes a graph and its headlines and captions and predicts whether an entity is linguistically salient.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatically Recognizing Intended Messages in Grouped Bar Charts

Information graphics (bar charts, line graphs, grouped bar charts, etc.) often appear in popular media such as newspapers and magazines. In most cases, the information graphic is intended to convey a high-level message; this message plays a role in understanding the document but is seldom repeated in the document’s text. This paper presents our methodology for recognizing the intended message o...

متن کامل

What is being Measured in an Information Graphic?

Information graphics (such as bar charts and line graphs) are widely used in popular media. The majority of such non-pictorial graphics have the purpose of communicating a high-level message which is often not repeated in the text of the article. Thus, information graphics together with the textual segments contribute to the overall purpose of an article and cannot be ignored. Unfortunately, in...

متن کامل

Communicative Signals as the Key to Automated Understanding of Simple Bar Charts

This paper discusses the types of communicative signals that frequently appear in simple bar charts and how we exploit them as evidence in our system for inferring the intended message of an information graphic. Through a series of examples, we demonstrate the impact that various types of communicative signals, namely salience, captions and estimated perceptual task effort, have on the intended...

متن کامل

A New Statistical Approach for Recognizing and Classifying Patterns of Control Charts (RESEARCH NOTE)

Control chart pattern (CCP) recognition techniques are widely used to identify the potential process problems in modern industries. Recently, artificial neural network (ANN) –based techniques are very popular to recognize CCPs. However, finding the suitable architecture of an ANN-based CCP recognizer and its training process are time consuming and tedious. In addition, because of the black box ...

متن کامل

Bar Charts Recognition Using Hough Based Syntactic Segmentation

Bar charts are common data representations in scientific and technical papers. In order to recognize the printed bar charst, we present a new Hough based bar chart recognition algorithm which combines syntactic analysis into segmentation. We first detect the most salient feature in any bar chart, bar patterns, using syntactic analysis in the Hough domain. Then we group text primitives according...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015